Post - Selection Inference Online Appendix

نویسندگان

  • Richard Berk
  • Lawrence Brown
  • Andreas Buja
  • Kai Zhang
  • Linda Zhao
چکیده

B.1 The Full Model Interpretation of Parameters. In the full model interpretation, coefficients always have the fixed meaning as full model parameters. Variable selection then means setting some coefficient estimates to zero, and these estimates always exist for all predictors, irrespective of whether they are selected or deselected. The full model interpretation of parameters is appropriate, for example, if the full model is viewed as “data generating” and the predictors are hence causal for the response, or if the full model describes a physical system where the full set of predictors is needed to capture the system fully. In such situations it is natural to consider the coefficients in the full model as targets of estimation, even though “nature” may choose to set some of them to zero. This view is meaningful for example in tomography applications where the variables constitute voxels and their coefficients are rates of absorption, hence variable selection amounts to selection of voxels with high absorption. The use of the selected voxels is for display and medical diagnosis, and there is no meaning in interpreting these voxels as constituting a submodel. If full model parameters are estimated by forcing some of them to zero and estimating the remainder via least squares, then the result is a type of shrinkage estimator for the full model parameters. Such estimators are often referred to as “preliminary test estimators”; see Saleh (2006) for a comprehensive treatment and many references. These estimators are also closely related to more recently studied “hard threshold” and “soft threshold” estimators; for a taste of the extensive literature on these and related estimators see Tsybakov (2009) and references therein. The “submodel” corresponding to non-zero parameter estimates is viewed as a computational compression and a parsimonious statistical summary of the data, but it is viewed neither as a model in its own right nor as an object of future scientific research. Inferential problems with the full model view of variable selection are pointed out by the “Vienna School” in the series of articles referenced in

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Post-Selection and Post-Regularization Inference in Linear Models with Very Many Controls and Instruments

In this note, we offer an approach to estimating structural parameters in the presence of many instruments and controls based on methods for estimating sparse high-dimensional models. We use these high-dimensional methods to select both which instruments and which control variables to use. The approach we take extends Belloni et al. (2012), which covers selection of instruments for IV models wi...

متن کامل

Online Appendix – NOT FOR PUBLICATION

This Online Appendix is divided into six parts. Appendix A provides a general statement of strategies and equilibria, and describes how we do equilibrium selection when there are multiple equilibria. Appendix B proves Proposition 3. Appendix C proves verbal statements in the analysis of experimentation about the principal’s commitment problem and institutional solutions. Appendix D contains acc...

متن کامل

APPENDIX 1: Making sense out of Akaike’s Information Criterion (AIC): its use and interpretation in model selection and inference from ecological data

A large component of ecology consists of observational studies where the goal is to explain a pattern, such as the number of individuals in a patch, with a series of explanatory (i.e., independent) variables. To do so, ecologists have long relied on hypothesis testing to include or exclude variables in models, although the conclusions often depend on the approach used (e.g., forward, backward, ...

متن کامل

Online Appendix: Latent Factor Regressions for the Social Sciences

Appendix RoadMap In this appendix I provide additional details of materials omitted from the main paper. Appendix A includes a summary of technical contributions as well as details of the estimation algorithms. Appendices B-D provide additional insights into particular areas of the literature. Appendices E-F provide additional details on simulations and applications. A Variational Inference Alg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013